@gen#meA 

Journals.ASM.org 

Genome Sequence of Escherichia coli 0157:H7 Strain 2886-75, 
Associated with the First Reported Case of Human Infection in the 
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First identified in 1982 as a human pathogen, enterohemorrhagic Escherichia coli of the 0157:H7 serotype is a major cause of 
food-borne acquired human infections. Here, we report the genome sequence of the first known strain of this serotype isolated 
in the United States. 



Received 25 November 201 3 Accepted 2 December 201 3 Published 9 January 201 4 

Citation Sanjar F, Hazen TH, Shah SM, Koenig SSK, Agrawal S, Daugherty S, Sadzewicz L, Tallon U, Mammel MK, Feng P, Soderlund R, Tarr PI, DebRoy C, Dudley EG, Cebula TA, 
Ravel J, Fraser CM, Rasko DA, Eppinger M. 2014. Genome sequence of Escherichia coli 01 57:H7 strain 2886-75, associated with the first reported case of human infection in the 
United States. Genome Announc. 2(1 ):e01 1 20-1 3. doi:l 0.1 1 28/genomeA.01 1 20-1 3. 

Copyright © 201 4 Sanjar et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported license. 
Address correspondence to Mark Eppinger, mark.eppinger@utsa.edu. 





Since the initial report in 1982 that Escherichia coli 0157:H7 is 
associated with severe human disease, the serotype 0157:H7 
has assumed a position of dominance among enterohemorrhagic 
E. coli (EHEC) serotypes in North America responsible for global 
widespread outbreaks of severe gastrointestinal disease (1,2). This 
lineage of Shiga toxin-producing E. coli (STEC) 0157:H7 is 
non-sorbitol fermenting and jS-glucuronidase negative and has 
evolved from an 055:H7 progenitor (3, 4). The isolation in 1975 
of this E. coli 0157:H7 strain, designated 2886-75, from an adult 
with hemorrhagic colitis (HC) (5, 6) predated the 1982 Oregon 
and Michigan hamburger-associated E. coli 0157:H7 outbreaks 
(6). Since 1982, this serogroup has emerged as the dominant cause 
of EHEC infections in North America. Infections typically present 
with symptoms of bloody diarrhea coupled with severe abdominal 
pain (5, 6) but can rapidly progress to life-threatening complica- 
tions, such as hemolytic uremic syndrome (HUS), HC, and cen- 
tral nervous system failure (7-12). 

Genomic DNA was subjected to Illumina sequencing using 
paired-end libraries with 300-bp inserts on the HiSeq platform. 
The draft genome was assembled with Velvet assembler (13, 14), 
and the IGS Annotation Engine and Manatee were used for ge- 
nome annotation and visualization (15). Availability of the 
high-quality genome sequence enabled the determination of the 
pathogenome virulence state (16) and phylogenomic grouping 
according to established genotypic classification methods using 
in-silico and experimental assays (17-20). PCR genotyping con- 
firmed the stx genotype and determined the occupancy of both the 
yeh Vand wrbA bacteriophage insertion sites (21). Strain 2886-75 



has an unusual genotype. Unlike the majority of E. coli 0157:H7 
recovered from humans in the United States (22-24), this isolate is 
stx 1 positive and stx 2 negative. The yehV site is occupied by the stx 1 
bacteriophage that is not stably integrated. Hence, the genomic 
architecture does not fit the emergence scenario typical of other 
human-pathogenic E. coli 0157:H7 strains, and this isolate cannot 
be placed into clusters 1, 2, or 3 (25). However, this strain shows 
other typical genetic hallmarks of EHEC. Strain 2886-75 carries 
the lineage-specific virulence plasmid p0157 (26, 27), the T allele 
of the translocated intimin receptor (fir) (255 T>A), and a chi- 
meric polymorphic variant of repeat region 1 (RR1) with the ab- 
sence of the repeat regions RR2 and RR3, placing strain 2886-75 
closest to group 8 (28). Multilocus sequence typing (MLST) (18) 
based on the nucleotide sequences of 15 housekeeping genes re- 
vealed that 2886-75 exhibits allele combination 23.11 (19) and 
belongs to the sequence type 11 (ST11) and complex/ ABD group 
(18, 20). Strain 2886-75 is a representative of lineage I (17, 29) and 
clade 3.16 (30). The genome sequence presented here will be a 
valuable resource in studying E. coli 0157:H7 pathogenome evo- 
lution by comparing this isolate to the extant genotypes and will 
aid in the development of a higher-resolution phylogenomic 
framework for improved molecular-guided pathogen surveillance 
and outbreak investigations (10, 11,31). 

Nucleotide sequence accession number. This genome se- 
quence is deposited in GenBank under the accession number 
AVRR00000000. A bacterial strain culture is available from the 
Biodefense and Emerging Infections Research Resources Reposi- 
tory (http://www.beiresources.org/). 
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